AMENDMENTS TO THE CLAIMS 



This listing of claims will replace all prior versions, and listings, of claims in the application: 
Listing of claims : 

1. (Original) A method for selecting from a plurality of different siRNAs one or more 
siRNAs for silencing a target gene in an organism, each of said plurality of different siRNAs 
targeting a different target sequence in a transcript of said target gene, said method 
comprising 

(a) ranking said plurality of different siRNAs according to positional base 
compositions of a corresponding targeted sequence motifs in said transcript, wherein each 
said targeted sequence motif comprises at least a portion of the target sequence of the 
corresponding siRNA and/or a second sequence in a sequence region flanking said target 
sequence; and 

(b) selecting one or more siRNAs from said ranked siRNAs. 

2. (Original) The method of claim 1, wherein each said sequence motif comprises said 
target sequence of said targeting siRNA. 

3. (Original) The method of claim 2, wherein said ranking step is carried out by (al) 
determining a score for each said different siRNA, wherein said score is calculated using a 
position-specific score matrix; and (a2) ranking said plurality of different siRNAs according 
to said score. 

4. (Original) The method of claim 3, wherein each said sequence motif is a nucleotide 
sequence of L nucleotides, L being an integer, and wherein said position-specific score matrix 
is {log(e / y//7,y)}, where eg is the weight of nucleotide i at position j\ py is the weight of 
nucleotide i at position j in a random sequence, and i = G, C, A, U(T), j = 7, L. 

5. (Original) The method of claim 3, wherein each said sequence motif is a nucleotide 
sequence of L nucleotides, L being an integer, and wherein said position-specific score matrix 
is {log(e ij/pij)} 9 where e,y is the weight of nucleotide i at position j\ py is the weight of 
nucleotide i at position j in a random sequence, and j = G or C, A, U(T), j = 7, L. 
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6. (Original) The method of claim 5, wherein said score for each said siRNA is 
calculated according to equation 

L 

Score = J] ln(e, / p t ) 

wherein said e t and p t are respectively weights of the nucleotide at position t in said sequence 
motif as determined according to said position-specific score matrix and in a random 
sequence. 

7. (Original) The method of claim 6, wherein each said sequence motif comprises said 
target sequence of said targeting siRNA and at least one flanking sequence. 

8. (Original) The method of claim 7, wherein each said sequence motif comprises said 
target sequence of said targeting siRNA and a 5' flanking sequence and a 3' flanking 
sequence. 

9. (Original) The method of claim 8, wherein said 5' flanking sequence and said 3' 
flanking sequence are each a sequence of D nucleotides, D being an integer. 

10. (Original) The method of claim 9, wherein each said target sequence is a sequence 
of 19 nucleotides, and each said 5' flanking sequence and 3' flanking sequence are a sequence 
of 10 nucleotides. 

1 1 . (Original) The method of claim 8, wherein each said target sequence is a sequence 
of 19 nucleotides, and each said 5' flanking sequence and 3' flanking sequence are a sequence 
of 50 nucleotides. 

12-20 (Canceled). 

21. (Original) The method of claim 1, further comprising a step of selecting one or 
more siRNAs based on silencing specificity, said step of selecting based on silencing 
specificity comprising, (i) for each of said plurality of siRNAs, predicting off-target genes of 
said siRNA from among a plurality of genes, wherein said off-target genes are genes other 
than said target gene and are directly silenced by said siRNA; (ii) ranking said plurality of 
siRNAs according to the number of off-target genes; and (iii) selecting one or more siRNAs 
for which said number of off-target genes is below a given threshold. 
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22. (Original) The method of claim 21, wherein said predicting comprises (il) 
evaluating sequence of each of said plurality of genes based on a predetermined siRNA 
sequence match pattern; and (i2) predicting said gene as an off-target gene if said gene 
comprise a sequence that matches said siRNA based on said sequence match pattern. 

23. (Original) The method of claim 22, wherein said step of evaluating comprises 
identifying an alignment of said siRNA to a sequence in a gene by a low stringency FastA 
alignment. 

24. (Original) The method of claim 23, wherein each said siRNA has L nucleotides in 
its duplex region, and wherein said match pattern is represented by a position match position- 
specific score matrix (pmPSSM), said position match position-specific score matrix 
consisting of weights of different positions in an siRNA to match transcript sequence 
positions in an off-target transcript {P,}, where j = 1, L, Pj is the weight of a match at 
position j. 

25. (Original) The method of claim 24, wherein said step (il) comprises calculating a 
position match score pmScore according to equation 

L 

pmScore = £ln(£, /0.25) 

where £,= Pi if position i is a match and E t = (l-P,)/3 if position i is a mismatch; and said 
step (i2) comprises predicting said gene as an off-target gene if said position match score is 
greater than a given threshold. 

26. (Original) The method of claim 25, wherein L is 19, and wherein said pmPSSM is 
given by Table I. 

27. (Original) The method of claim 26, wherein said plurality of genes comprises all 
known unique genes of said organism other than said target gene. 

28. (Original) The method of claim 10, wherein said position-specific score matrix 
(PSSM) is obtained by a method comprising 

(aa) identifying a plurality of N siRNAs consisting of siRNAs having 19-nucleotide 
duplex region and having a silencing efficacy above a chosen threshold; 
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(bb) identifying for each said siRNA a functional sequence motif, said functional 
sequence motif comprising a 19-nucleotide target sequence of said siRNA and a 10- 
nucleotide 5' flanking sequence and a 10-nucleotide 3' flanking sequence; 

(cc) calculating a frequency matrix {f v } 9 where i = G, C, A, U(T);j = 1, 2, and 
where f 0 is the frequency of the ith nucleotide at the yth position, based on said siRNAs 
functional sequence motifs according to equation 



(dd) determining said PSSM by calculating e,y according to equation 



29. (Original) The method of claim 28, wherein said plurality of N siRNAs target a 
plurality of different genes having different transcript abundances in a cell. 

30. (Original) The method of claim 29, wherein said step (b) is carried out by 
selecting one or more siRNAs having the highest scores. 

31. (Original) The method of claim 29, wherein said step (b) is carried out by 
selecting one or more siRNAs having a score closest to a predetermined value, wherein said 
predetermined value is the score value corresponding to the maximum median silencing 
efficacy of a plurality of siRNA sequence motifs. 

32. (Original) The method of claim 31, wherein said plurality of siRNA sequence 
motifs are sequence motifs in transcript having abundance level of less than about 3-5 copies 
per cell. 

33. (Original) The method of claim 29, wherein said step (b) is carried out by 
selecting one or more siRNAs having a score within a predetermined range, wherein said 
predetermined range is a score range corresponding to a plurality of siRNAs sequence motifs 
having a given level of silencing efficacy. 



where 
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34. (Currently Amended) The method of claim 33, wherein said silencing efficacy is 
above 50% , 75%, or 90% at an siRNA dose of about lOOnM. 

35. (Original) The method of claim 34, wherein said plurality of siRNA sequence 
motifs are sequence motifs in transcript having abundance level of less than about 3-5 copies 
per cell. 

36. (Canceled). 

37. (Currently Amended) The method of any on e of claims 5 1 1 claim 5, wherein said 
position-specific score matrix (PSSM) is obtained by a method comprising 

(aa) initializing said PSSM with random weights; 

(bb) selecting randomly a weight w ;> obtained in (aa); 

(cc) changing the value of said selected weight to generate a test psPSSM comprising 
said selected weight having said changed value; 

(dd) calculating a score for each of a plurality of siRNAs functional sequence motifs 
using said test PSSM according to equation 

L 

Score = ^ln(w k /p k ) 

wherein said and pk are respectively weights of a nucleotide at position k in said functional 
sequence motif and in a random sequence; 

(ee) calculating correlation of said score and a metric of a characteristic of an siRNA 
among said plurality of siRNAs functional sequence motifs; 

(fit) repeating steps (cc)-(ee) for a plurality of different values of said selected weight 
in a given range and retain the value that corresponds to the best correlation for said selected 
weight; and 

(gg) repeating steps (bb)-(ff) for a chosen number of times; thereby determining said 

PSSM. 
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38. (Original) The method of claim 37, further comprising selecting said plurality of 
siRNA functional sequence motifs by a method comprising . 

(i) identifying a plurality of siRNAs consisting of siRNAs having different values in 
said metric; 

(ii) identifying a plurality of siRNA functional sequence motifs each corresponding to 
an siRNA in said plurality of siRNAs. 

39. (Original) The method of claim 38, wherein said characteristic is silencing 
efficacy. 

40. (Original) The method of claim 39, wherein said plurality of N siRNAs target a 
plurality of different genes having different transcript abundances in a cell. 

41. (Original) The method of claim 40, wherein said step (b) is carried out by 
selecting one or more siRNAs having the highest scores. 

42. (Original) The method of claim 40, wherein said step (b) is carried out by 
selecting one or more siRNAs having a score closest to a predetermined value, wherein said 
predetermined value is the score value corresponding to the maximum median silencing 
efficacy of a plurality of siRNA sequence motifs. 

43. (Original) The method of claim 42, wherein said plurality of siRNA sequence 
motifs are sequence motifs in transcript having abundance level of less than about 3-5 copies 
per cell. 

44. (Original) The method of claim 40, wherein said step (bb) is carried out by 
selecting one or more siRNAs having a score within a predetermined range, wherein said 
predetermined range is a score range corresponding to a plurality of siRNAs sequence motifs 
having a given level of silencing efficacy. 

45. (Currently Amended) The method of claim 44, wherein said silencing efficacy is 
above 50% , 75%, or 90% at an siRNA dose of about lOOnM. 

46. (Original) The method of claim 45, wherein said plurality of siRNA sequence 
motifs are sequence motifs in transcript having abundance level of less than about 3-5 copies 
per cell. 
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47. (Canceled). 

48. (Original) The method of claim 37, wherein said position-specific score matrix 
(PSSM) comprises w*, k =7, L, being a difference in probability of finding nucleotide 
G or C at sequence position k between a first type of siRNA and a second type of siRNA, and 
wherein said score for each said strand is calculated according to equation 

Score = y^ j w k . 

49. (Original) The method of claim 48, wherein said first type of siRNA consists of 
one or more siRNAs having silencing efficacy no less than a first threshold and said second 
type of siRNA consists of one or more siRNAs having silencing efficacy less than a second 
threshold. 

50. (Original) The method of claim 49, wherein said difference in probability is 
described by a sum of Gaussian curves, each of said Gaussian curves representing the 
difference in probability of finding a G or C at a different sequence position . 

51. (Original) The method of claim 50, wherein said first and second threshold are 
both 75% at an siRNA dose of lOOnM. 

52. (Original) A method for selecting from a plurality of different siRNAs one or 
more siRNAs for silencing a target gene in an organism, each of said plurality of different 
siRNAs targeting a different target sequence in a transcript of said target gene, said method 
comprising 

(a) ranking said plurality of different siRNAs according to positional base 
composition of reverse complement sequences of sense strands of said siRNAs; and 

(b) selecting one or more siRNAs from said ranked siRNAs. 

53. (Original) The method of claim 52, wherein said ranking step is carried out by 
(al) determining a score for each said different siRNA, wherein said score is calculated using 
a position-specific score matrix; and (a2) ranking said plurality of different siRNAs according 
to said score. 
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54. (Original) The method of claim 53, wherein said siRNA has a nucleotide sequence 
of L nucleotides in its duplex region, L being an integer, wherein said position-specific score 
matrix comprises w*, k =/, L, vt>* being a difference in probability of finding nucleotide G 
or C at sequence position k between reverse complement of sense strand of a first type of 
siRNA and reverse complement of sense strand of a second type of siRNA, and wherein said 
score for each said reverse complement is calculated according to equation 

L 

Score = ^ w k . 

55. (Original) The method of claim 54, wherein said first type of siRNA consists of 
one or more siRNAs having silencing efficacy no less than a first threshold and said second 
type of siRNA consists of one or more siRNAs having silencing efficacy less than a second 
threshold. 

56. (Original) The method of claim 55, wherein said difference in probability is 
described by a sum of Gaussian curves, each of said Gaussian curves representing the 
difference in probability of finding a G or C at a different sequence position . 

57. (Original) The method of claim 56, wherein said first and second threshold are 
both 75% at an siRNA dose of lOOnM. 

58. (Original) A method for selecting from a plurality of different siRNAs one or 
more siRNAs for silencing a target gene in an organism, each of said plurality of different 
siRNAs targeting a different target sequence in a transcript of said target gene, said method 
comprising, (i) for each of said plurality of different siRNAs, predicting off-target genes of 
said siRNA from among a plurality of genes, wherein said off-target genes are genes other 
than said target gene and are directly silenced by said siRNA; (ii) ranking said plurality of 
different siRNAs according to the number of off-target genes; and (iii) selecting one or more 
siRNAs for which said number of off-target genes is below a given threshold. 

59-66 (Canceled). 

67. (Original) A method for determining a base composition position-specific score 
matrix (bsPSSM) {log(e,y/p,y)} for representing base composition patterns of siRNA 
functional sequence motifs of L nucleotides in transcripts, wherein i = G, C, A, U(T) and j = 
1, 2, L 9 and wherein each said siRNA functional sequence motif comprises at least a 
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portion of the target sequence of the corresponding targeting siRNA and/or a sequence in a 
sequence region flanking said target sequence, said method comprising 

(a) identifying a plurality of N different siRNAs consisting of siRN As having a 
silencing efficacy above a chosen threshold; 

(b) identifying a plurality of N corresponding siRN A functional sequence motifs, one 
for each said different siRNA; 

(c) calculating a frequency matrix {/Jy}, where / = G, C, A, U(T);j = 1,2, Z, and 
where f 0 is the frequency of the ith nucleotide at the jth position, based on said plurality of N 
siRNAs functional sequence motifs according to equation 



(d) determining said psPSSM by calculating e,y according to equation 




68. (Original) The method of claim 67, wherein each said siRNA functional motif 
comprises the target sequence of the corresponding targeting siRNA and one or both flanking 
sequences of said target sequence. 

69. (Original) The method of claim 68, wherein each said siRNA has M nucleotides in 
its duplex region, and wherein each said siRNA functional sequence motif consists of an 
siRNA target sequence of M nucleotides, a 5' flanking sequence of £>/ nucleotides and a 3' 
flanking sequence of D2 nucleotides. 

70. (Original) The method of claim 69, wherein each said siRNA has 19 nucleotides 
in its duplex region, and wherein each said siRNA functional sequence motif consists of an 
siRNA target sequence of 19 nucleotides, a 5' flanking sequence of 10 nucleotides and a 3' 
flanking sequence of 10 nucleotides. 



where 
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71. (Original) The method of claim 69, wherein each said siRNA has 19 nucleotides 
in its duplex region, and wherein each said siRNA functional sequence motif consists of an 
siRNA target sequence of 19 nucleotides, a 5' flanking sequence of 50 nucleotides and a 3' 
flanking sequence of 50 nucleotides. 

72. (Original) The method of claim 67, wherein said plurality of N siRNAs each 
targets a gene whose transcript abundance is within a given range in a cell. 

73. (Currently Amended) The method of claim 72, wherein said range is at least about 
5 , 10, or 100 transcripts per cell. 

74. (Original) The method of claim 72, wherein said range is less than about 3-5 
transcripts per cell. 

75. (Currently Amended) The method of any on e of claims 67 71 claim 67, wherein 
said threshold is 50% , 75%, or 90% at an siRNA dose of about lOOnM. 

76. (Canceled). 

77. (Original) A method for determining a base composition position-specific score 
matrix (bsPSSM) {wy} for representing a base composition pattern representing a plurality of 
different siRNA functional sequence motifs of L nucleotides, wherein i = G, C, A, U(T) and j 
= 1, 2, L, and wherein each said siRNA functional sequence motif comprises at least a 
portion of the target sequence of the corresponding targeting siRNA and/or a sequence in a 
sequence region flanking said siRNA target sequence, said method comprising 

(a) initializing said bsPSSM with random weights; 

(b) selecting randomly a weight Wy obtained in (a); 

(c) changing the value of said selected weight to generate a test psPSSM comprising 
said selected weight having said changed value; 

(d) calculating a score for each of said plurality of siRNAs functional sequence motifs 
using said test psPSSM according to equation 

L 

Score = ]T ln( w k I p k ) 
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wherein said m>* and /?* are respectively weights of a nucleotide at position k in said functional 
sequence motif and in a random sequence; 

(e) calculating correlation of said score and a metric characterizing an siRNA among 
said plurality of siRNAs functional sequence motifs; 

(f) repeating steps (c)-(e) for a plurality of different values of said selected weight in a 
given range and retain the value that corresponds to the best correlation for said selected 
weight; and 

(g) repeating steps (b)-(f) for a chosen number of times; thereby determining said 
psPSSM. 

78. (Original) A method for determining a base composition position-specific score 
matrix (bsPSSM) {wy} for representing a base composition pattern representing a plurality of 
different siRNA functional sequence motifs of L nucleotides, wherein j = G/C, A, U(T) and j 
— 7, 2, L, and wherein each said siRNA functional sequence motif comprises a least a 
portion of the target sequence of the corresponding siRNA and/or a sequence in a sequence 
region flanking said siRNA target sequence, said method comprising 

(a) initializing said bsPSSM with random weights; 

(b) randomly selecting a weight Wy obtained in (a); 

(c) changing the value of said selected weight to generate a test psPSSM comprising 
said selected weight having said changed value; 

(d) calculating a score for each of said plurality of siRNA functional sequence motifs 
using said test psPSSM according to equation 

L 

Score = ]Tln(nv / p k ) 

wherein said w k and pk are respectively weights of a nucleotide at position k in said functional 
sequence motif and in a random sequence; 

(e) calculating a correlation of said score and a metric of a characteristic of an siRNA 
among said plurality of siRNAs functional sequence motifs; 
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(f) repeating steps (c)-(e) for a plurality of different values of said selected weight in a 
given range and retain the value that corresponds to the best correlation for said selected 
weight; and 

(g) repeating steps (b)-(f) for a chosen number of times; thereby determining said 
psPSSM. 

79. (Currently Amended) The method of claim 77 [[or 78]], wherein each said siRNA 
functional motif comprises the target sequence of the corresponding targeting siRNA and one 
or both flanking sequences of said target sequence. 

80. (Original) The method of claim 79, further comprising selecting said plurality of 
siRNA functional sequence motifs by a method comprising 

(i) identifying a plurality of siRNAs consisting of siRNAs having different values in 
said metric; 

(ii) identifying a plurality of siRNA functional sequence motifs each corresponding to 
an siRNA in said plurality of siRNAs. 

81. (Original) The method of claim 79, wherein each said siRNA has M nucleotides in 
its duplex region, and wherein each said siRNA functional sequence motif consists of an 
siRNA target sequence of M nucleotides and a Di nucleotide flanking sequence upstream and 
a Dj nucleotide flanking sequence downstream. 

82. (Original) The method of claim 81, wherein each said siRNA has 19 nucleotides 
in its duplex region, and wherein said siRNA functional sequence motif consists of an siRNA 
target sequence of 19 nucleotides and a 10 nucleotide flanking sequence upstream and a 10 
nucleotide flanking sequence downstream. 

83. (Original) The method of claim 82, wherein each said siRNA has 19 nucleotides 
in its duplex region, and wherein said siRNA functional sequence motif consists of an siRNA 
target sequence of 19 nucleotides and a 50 nucleotide flanking sequence upstream and a 50 
nucleotide flanking sequence downstream. 

84. (Original) The method of claim 82, wherein said metric is silencing efficacy. 
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85. (Original) The method of claim 84, wherein said plurality of siRNAs consisting of 
siRNAs targeting genes whose transcript abundance is in a given range in a cell. 

86. (Currently Amended) The method of claim 85, wherein said range is at least about 
5 , 10, or 100 transcripts per cell. 

87. (Original) The method of claim 85, wherein said range is less than about 3-5 
transcripts per cell. 

88. (Currently Amended) The method of any on e of claims 77 7 8 claim 77, wherein 
said threshold is 50% , 75%, or 90% at an siRNA dose of about lOOnM. 

89. (Original) The method of claim 84, further comprising evaluating said psPSSM 
using an ROC (receiver operating characteristic) curve of the sensitivity of said psPSSM vs. 
the non-specificity of said psPSSM curve, said sensitivity of said PSSM being the proportion 
of true positives detected using said psPSSM as a fraction of total true positives, and said 
non-specificity of said PSSM being the proportion of false positives detected using said 
psPSSM as a fraction of total false positives. 

90. (Currently Amended) The method of claim 84, wherein said plurality of siRNA 
functional sequence motifs consists of at least 5 0, at l e ast 100, or at l e ast 200 different 
siRNAs functional sequence motifs. 

91. (Original) The method of claim 84, further comprising testing said psPSSM using 
another plurality of siRNA functional sequence motifs. 

92. (Original) A method for determining a position match position-specific score 
matrix (pmPSSM) {£,} for representing position match pattern of an siRNA of L nucleotides 
with its target sequence in a transcript, wherein E t is a score of a match at position /, j = 7, 2, 

Z,, said method comprising 

(a) identifying a plurality of N siRNA off-target sequences, wherein each said off- 
target sequence is a sequence on which said siRNA exhibits silencing activity; 

(b) calculating a position match weight matrix {/\}, where j = 7, 2, L 9 based on 
said plurality of N siRNAs off-target sequences according to equation 
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where is 1 if k is a match, and is 0 if A: is a mismatch; and 

(c) determining said psPSSM by calculating E t such that £,= Pi if position i is a 
match and £, = (l-P,)/3 if position i is a mismatch. 

93-94 (Canceled). 

95. (Original) A method for evaluating the relative activity of the two strands of an 
siRNA in off-target gene silencing, comprising comparing position specific base composition 
of the sense strand of said siRNA and position specific base composition of the antisense 
strand of said siRNA or reverse complement strand of said sense strand of said siRNA, 
wherein said antisense strand is the guiding strand for targeting the intended target sequence. 

96-113 (Canceled). 
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